The spectrum of genomic signatures: from dinucleotides to chaos game representation.
نویسندگان
چکیده
In the post genomic era, access to complete genome sequence data for numerous diverse species has opened multiple avenues for examining and comparing primary DNA sequence organization of entire genomes. Previously, the concept of a genomic signature was introduced with the observation of species-type specific Dinucleotide Relative Abundance Profiles (DRAPs); dinucleotides were identified as the subsequences with the greatest bias in representation in a majority of genomes. Herein, we demonstrate that DRAP is one particular genomic signature contained within a broader spectrum of signatures. Within this spectrum, an alternative genomic signature, Chaos Game Representation (CGR), provides a unique visualization of patterns in sequence organization. A genomic signature is associated with a particular integer order or subsequence length that represents a measure of the resolution or granularity in the analysis of primary DNA sequence organization. We quantitatively explore the organizational information provided by genomic signatures of different orders through different distance measures, including a novel Image Distance. The Image Distance and other existing distance measures are evaluated by comparing the phylogenetic trees they generate for 26 complete mitochondrial genomes from a diversity of species. The phylogenetic tree generated by the Image Distance is compatible with the known relatedness of species. Quantitative evaluation of the spectrum of genomic signatures may be used to ultimately gain insight into the determinants and biological relevance of the genome signatures.
منابع مشابه
Self-Similarity Limits of Genomic Signatures
It is shown that metric representation of DNA sequences is one-to-one. By using the metric representation method, suppression of nucleotide strings in the DNA sequences is determined. For a DNA sequence, an optimal string length to display genomic signature in chaos game representation is obtained by eliminating effects of the finite sequence. The optimal string length is further shown as a sel...
متن کاملSystematic CpT (ApG) Depletion and CpG Excess Are Unique Genomic Signatures of Large DNA Viruses Infecting Invertebrates
Differences in the relative abundance of dinucleotides, if any may provide important clues on host-driven evolution of viruses. We studied dinucleotide frequencies of large DNA viruses infecting vertebrates (n = 105; viruses infecting mammals = 99; viruses infecting aves = 6; viruses infecting reptiles = 1) and invertebrates (n = 88; viruses infecting insects = 84; viruses infecting crustaceans...
متن کاملRepresentation of Gender Roles in Child and Young Characters in Game of Thrones Series
he purpose of this study is to demonstrate how we, especially children and adolescents, are influenced by the media; this issue had investigated by analyzing the representation of gender roles in the Game of Thrones Series. There were based on three social learning, socialization and cultivation theories. This research had done a quantitative content analysis. The variables included 20 gender a...
متن کاملSecondary Structural Analysis of Families of Protein Sequences using Chaos Game Representation
CGR is an effective method for visualizing any structural features if it is given as a sequence of elements [1,2] analyzed by the genomic signature appears as a powerful tool for investigating the mechanisms of DNA maintenance from which the DNA structure results. It would be necessary to understand the patterns they exhibit and to be able to interpret them in a biologically meaningful way [3]....
متن کاملConservative chaotic flow generated via a pseudo-linear system
Analysis of nonlinear autonomous systems has been a popular field of study in recent decades. As an interesting nonlinear behavior, chaotic dynamics has been intensively investigated since Lorenz discovered the first physical evidence of chaos in his famous equations. Although many chaotic systems have been ever reported in the literature, a systematic and qualitative approach for chaos generat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Gene
دوره 346 شماره
صفحات -
تاریخ انتشار 2005